Evaluation of Ancient Documents and Images by using Phase Based Binarization

نویسندگان

  • K. SRUJANA
  • VINOD R KUMAR
چکیده

Segmentation of text from badly degraded document images are a very challenging task due to the high inters/intravariation between the document background and the foreground text of different document images. These features are the maximum moment of phase congruency covariance, a locally weighted mean phase angle, and a phase preserved denoised image. The proposed model consists of three standard steps: 1) preprocessing; 2) main binarization; and 3) postprocessing. In the preprocessing and main binarization steps, the features used are mainly phase derived, while in the postprocessing step, specialized adaptive Gaussian and median filters are considered. One of the outputs of the binarization step, which shows high recall performance, is used in a proposed postprocessing method to improve the performance of other binarization methodologies. Finally, we develop a ground truth generation tool, called PhaseGT, to simplify and speed up the ground truth generation process for ancient document images. The comprehensive experimental results on the DIBCO’09, HDIBCO’10, DIBCO’11, H-DIBCO’12, DIBCO’13, PHIBD’12, and BICKLEY DIARY data sets show the robustness of the proposed binarization method on various types of degradation and document images. Experiments on the Bickley diary dataset that consists of several challenging bad quality document images also show the superior performance of our proposed method, compared with other

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phase-Based Binarization of Ancient Document Images

The main defects present in historical documents are darkness, non-uniform clarification, bleed-through and faded characters. To remove these defects binarization method is used. In this paper a phase based binarization method is studied in which phase of ancient document images is preserved. This method is derived in to three steps: preprocessing, main binarization and post processing. In prep...

متن کامل

Ancient Document Images Enhancement Using Phase Based Binarization

In this paper, we present a phase-based binarization model for degraded document images, also a post processing method that can improve any binarization method and a ground truth generation tool. Usually, many binarization techniques are implemented in the literature for different types of binarization problems. It include an adaptive image contrast based document image binarization technique t...

متن کامل

Comparison of Niblack inspired binarization methods for ancient documents

In this paper, we present a new sliding window based local thresholding technique ‘NICK’ and give a detailed comparison of some existing sliding-window based thresholding algorithms with our method. The proposed method aims at achieving better binarization results, specifically, for ancient document images. NICK has been inspired from the Niblack’s binarization method and exhibits its robustnes...

متن کامل

Binarization Of Ancient Document Images

Ancient documents accumulate a significant amount of human heritage over time. However, many environmental factors, improper handling, and the poor quality of the materials used in their creation cause them to suffer a high degree of degradation of various types. There are lots of ancient documents which are badly degraded. It is very difficult to segment text from the document, as there is a v...

متن کامل

Improving Degraded Ancient Document Images Using Phase-based Binarization Model

Here presenting a phase-based binarization model for ancient document images, and also a post processing method which can improve any binarization method and a ground truth generation tool. Three feature maps derived from the phase information of an input document image form the core of this binarization model. These features are the maximum moment of phase corresponding to covariance, a locall...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015